Optical Character Recognition for Marathi Text Newsprint
نویسندگان
چکیده
منابع مشابه
Segmentation Based Optical Character Recognition for Handwritten Marathi characters
Valuable ancient documents like historical books, old scripts etc. are available in specific regional languages. Problems occur when those documents have to be preserve in digital form or to modify them. Optical Character Recognition is used to convert the scanned document word, notepad or any other format, so that we can easily edit that document. A complete OCR system for handwritten Devanaga...
متن کاملOn Optical Character Recognition of Arabic Text
Although, optical character recognition has made tremendous achievements in the area of desktop publishing, yet a huge amount of work is required to be done. Unlike Roman like languages, there are various languages possessing a large number of fonts and/or having complicated shapes. Arabic language is one of those languages, which is somewhat complicated in its construction. Although a reasonab...
متن کاملOptical Character Recognition from Text Image
Optical Character Recognition (OCR) is a system that provides a full alphanumeric recognition of printed or handwritten characters by simply scanning the text image. OCR system interprets the printed or handwritten characters image and converts it into corresponding editable text document. The text image is divided into regions by isolating each line, then individual characters with spaces. Aft...
متن کاملMulti-font Optical Character Recognition System for Printed Telugu Text
The Telugu OCR systems available in the market currently recognize only the specific fonts of Telugu. This paper describes the development of a multi-font OCR system for printed Telugu characters using Artificial Neural Networks. In this system classification of the characters is carried out using multi layer neural network Architecture.
متن کاملSegmentation-free optical character recognition for printed Urdu text
This paper presents a segmentation-free optical character recognition system for printed Urdu Nastaliq font using ligatures as units of recognition. The proposed technique relies on statistical features and employs Hidden Markov Models for classification. A total of 1525 unique high-frequency Urdu ligatures from the standard Urdu Printed Text Images (UPTI) database are considered in our study. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2013
ISSN: 0975-8887
DOI: 10.5120/10163-4903